09. HTML File Structure

HTML File Structure

The Hypertext Markup Language (or HTML) is the language used to create documents for the World Wide Web.

Let's turn to Cameron Pittman , an instructor and Full Stack Engineer at Udacity, to introduce the basic structure of HTML files. The three short videos below are all you need to know to start web scraping. If you'd like to learn more, or are feeling like there are knowledge gaps you'd like to fill in, I encourage you to check out Cameron's "Intro to HTML and CSS" course. You can find it here .

As you're following along with the video, open up one of the HTML files you just downloaded in a text editor (like Sublime) and look for similarities in the HTML document that Cameron uses as an example. We'll do this together soon.

HTML Structure

HTML Structure

Trees

Trees

HTML and Trees

HTML and Trees

Rotten Tomatoes HTML

HTML File Structure

HTML File Structure Quiz

Below is an archived snippet of HTML code from the E.T. The Extra-Terrestrial Rotten Tomatoes page . The snippet contains the 'AUDIENCE SCORE' text and lacks styling (as you can see in the file preview on the right below).

Workspace

This section contains either a workspace (it can be a Jupyter Notebook workspace or an online code editor work space, etc.) and it cannot be automatically downloaded to be generated here. Please access the classroom with your account and manually download the workspace to your local machine. Note that for some courses, Udacity upload the workspace files onto https://github.com/udacity , so you may be able to download them there.

Workspace Information:

  • Default file path:
  • Workspace type: html-live
  • Opened files (when workspace is loaded): n/a

What type of HTML tag is the 72% audience score in?

SOLUTION: span